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1 Introduction 

The formula for the entropy of a density matrix, 

S(p) = S vN (p) = -k B Tipln(p), (1) 

introduced by von Neumann |vN29j . has been successfully used in statistical 
mechanics dealing with equilibrium states of large systems. The enormous success 
of its use has the effect, that questions about a basic justification have been posed 
only lately. The same is to be said about Shannon's formula, put forward in 
[S48t ISW49j for the entropy of classical information; but see [AFN74] . 

We see several reasons to "revisit" the basic axiomatics. One is to ask for 
counterparts to Lieb-Yngvason's axioms for thermodynamic entropy and the sec- 
ond law of thermodynamics, [LY98[ ILY991 ILYOOt ILY03] . Then the development 
of quantum engineering, handling of nano-physics and emergence of quantum- 
information leads to a new interest in basic theories of entropy-like functionals, 
characterizing states according to their "mixedness" , giving some kind of distance 
to the pure states. This characterization needs not a priori to be in relation to 
thermodynamics. 

Not only many aims are new, also the foundations both of quantum mechanics 
and of thermodynamics are now seen differently, in another light than in former 
times. Entanglement and decoherence may be considered as a special origin of 
the second law of thermodynamics, as is done f.e. in [PSW06J, and explanations 
of concepts by referring to our "natural" classical intuition, as in [075J which 
follows on |AFN74j . seem questionable. These classical notions refer to consid- 
erations on building up a system from its parts. But the present paper is built 
on the basic concepts of probability as expressed in density matrices. The funda- 
mental relation of probability to the physical world, irrespective of philosophical 
interpretations, is through the law of large numbers. Contrary to considering 
a state of a system as made up of different parts we consider unlimited repet- 
itive appearance of one and the same state, drawing a characterization of the 
single state out of its appearance in large numbers. After the presentation of 
formal settings and of the axioms in the two following sections, a discussion of 
deeper thoughts on epistemological considerations and reasoning follows. Also 
the backgrounds in history and in more recent developments are included in this 
discourse. 

A word on "axioms" : A set of axioms can either define a framework for a whole 
class of theories, (like Kolmogoroff 's axioms for probability), or it can, in the other 
extreme, present the postulates building up one special theory (like Newton's 
axioms for mechanics or Peano's axioms for the natural numbers). While the 
Lieb-Ynvason axioms are rather of the first kind, the axioms presented here are 
a sequence of increasingly detailed postulates leading in several steps to the von 
Neumann entropy. 
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2 The axioms and their framework 
2.1 Setup 

Non-relativistic quantum mechanics is the framework. Each physical system A, 
B, ... is represented by a separable Hilbert space J£% Jtfg, . . . , states are repre- 
sented by density matrices p^., pg, .... Superselection rules like conservation of 
particle numbers give a decomposition into superselection sectors M'a = @^ J&A,e, 
with restricting density matrices to be of block-diagonal form, pa = PA,e, for- 
bidding non-zero off-diagonal matrix elements connecting different sectors. 

A compound system A U B consisting of distinguishable subsystems A, B 
is represented by M'a © A superselection sector for system A is defined 

as a space spanned by all the eigenvectors belonging to an eigenvalue q of an 
operator Qa, or to a set of eigenvalues q a of several operators Q a ,A, like particle 
numbers and charges. In a compound system the particles, charges, etc. may 
be distributed over the subsystems. A sector there is defined as a subspace 
characterized by the eigenvalues of Q a = Q a ,A © 1 © 1 © Q a ,B- 

Considering compound systems consisting of several versions of the same sys- 
tem, we denote the n-fold tensor products as Jff® n . If each one of the subsystems 
is in the same state, and when there are no correlations between them, we denote 
the composed state as p|| n . 

In the course of stating axioms and developing their implications we use firstly 
systems with Hilbert spaces where all superselection numbers are specified, like 
"a box with three atoms of gold and one atom of lead" . I name such a space an 
"undivided" Hilbert space. Considering transformations of states, either through 
natural or technically manipulated time evolution, or as hypothetical reversible 
mappings, each unitary transformation p 1— > a = U pU* on such an undivided 
Hilbert space is admissible. Acting in a Hilbert space with several sectors, to be 
consistent with the superselection rules the unitaries have to commute with all 
the Q a . From standard quantum theories of particles we take the existence of an 
undivided Hilbert space with infinite dimension for granted. 

We distinguish a special type of density matrix ir(N) := jjPn, with P/v a 
projector onto an undivided (sub)space with dimension N. Such a ir(N) is the 
quantized version of distributions introduced by Laplace in the theory of chance 
and by Boltzmann in Statistical Mechanics; we call these ir(N) the "Quantum 
Laplace Boltzmann (QLB) states". 

When it comes to the details concerning the law of large numbers, the partial 
traces are needed to form the essential one-system density matrices u> defined on 
Jt? : out of a n-system density matrix f2 defined on . 

to(k) := Tl* p ar tial, k excluded {Q) (2) 

Finally, to get the best possible kind of continuity (which will turn out to be only 
a lower semicontinuity), we refer to the topology of states induced by the trace 
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norm of density matrices. 

Now the entropy S shall be a non-negative function on the set of states. 
(A discussion on this simplifying assumption is presented in Section EJ) This 
function S shall obey the following 

2.2 Axioms 

2.2.1 Equivalences 

If a compound system is considered where one part of it is in a pure state like 
Pb = \4'b){' 1 Pb\, the entropy of the whole system is determined by the other part 
(or parts) only, as if part B where absent: 

A) Decomposition: S(p A ® \^b){^b\) = S(pa), SflVuXVul® Pb) = S(p B )- 

Each unitary transformation of states, which commutes with particle number 
operators and other superselection rules, conserves entropy: 

B) Unitary invariance: S(p^) = S{oj) if °A — U-p^-U*, [U,Q a ] = Va. 

These two axioms imply universality, they enable comparisons of all systems 
with undivided Hilbert spaces: One may choose one system with an infinite 
dimensional undivided Hilbert space (without an index) as a reference system, 
and a pure state \ip)(ip\ with ip G Jf. For any p A on the undivided Jt?^ one has 
S{pa) — S(pa ® IV'KV'Dj and this compound state on Jtf^ (g) Jff, which is also 
undivided, may be unitarily transformed to \ipA) with any ip^ e One 

gets S{pX) = S(p), now with p a density matrix on the referencial Hilbert space 
Jif, equivalent to p^ by a (partial) isometry. So S does not depend on the type 
of matter. One may compare the entropy of a system with three atoms of gold in 
one box with the entropy of two atoms of lead in another box. Entropy of such 
a state is a mathematical function of the spectral values (including multiplicity) 
of density operators only. It is also independent on the dimension of the Hilbert 
space. 

2.2.2 QLB states and repeated appearances 

Two QLB states 7r(M) and tc(N) are equivalent in the sense of axioms A and B 
when M = N. Therefore their entropy is characterized by the dimension of their 
range only. It is "natural" to consider here strict monotonicity: 

C) Monotonicity: M > N =>• S(ir(M)) > S(ir(N)). 

A compound system, consisting of n equivalent systems in completely equiv- 
alent states, without any correlation, shall have n times the entropy of each of 
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its parts: 

D) Extensivity, discrete scaling: S(p® n ) = n ■ S(p) 

Now we have excluded any trivial S, and we can begin to justify the restriction 
of S to non-negative values: Note that 7r(l) is a pure state, by axiom A it is 
equivalent to 7r(l) <S> tt(1), so discrete scaling gives 

5(tt(1)) = 5(tt(1) <g> tt(1)) = 2 ■ 5(tt(1)). 

Attribution of oo is impossible, since other S(tt(N)) have to be strictly larger, 
— oo has been excluded a priori, so ^(^(l)) = remains as the only allowed value. 

A "calibration" is possible, setting S(tt(2)) = k B ■ In 2, and we can prove 
Boltzmann's formula (written down by Planck) for the QLB states as a strict 
consequence of these axioms: 

1 THEOREM. The Boltzmann- Entropy. Assuming axioms A, ... D, and 
choosing some universal constant k B , the Boltzmann-Planck formula for the en- 
tropy of QLB states holds: 

S(tt(N)) = k B ■ InN. (3) 

Proof. Observe that 7r(N)® n = 7r(N n ), and consider numbers m, n, obeying 
2 m < N n < 2 m+1 . Then m ■ k B ■ In 2 < n ■ S(n(N)) < (m + 1) • k B ■ In 2. In the 
limit of large numbers, fixed, one gets 

SMN)) = ( lim — V k B ■ ln2 = k B ■ hiN. 

\n-s>oo n / 

□ 

2.2.3 Generality via Large Numbers 

The properties of entropy shall reflect the law of large numbers. It states that 
repeated appearance of the same state, without correlations, becomes similar to 
a certain compound state with discrete uniform distribution. In subsection 13.41 
we present details, justifying the following strict condition on the function S. 

E) Rational combinatorics with large numbers: 

If the density matrix p has finite rank and only rational numbers as eigenvalues, 
consider numbers n which are common multiples of their denominators and con- 
sider QLB states fl with range in M'® n which simulate the state p® n in such a 
way, that each partial trace uj{k) of Q, as defined in equ.(J2]), gives exactly p. The 
entropy of p® n which, by axiom D, is n times the entropy of p, shall dominate 
the entropy of each of these fl, but it shall not be larger then necessary for this 
dominance: 
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S(p) := sup <^ -S(Q) | range(O) C Q a QLB state, Wk u(k) = p } 

n,n {n J 

Those states for which entropy is already defined through axioms A . . . E form 
a dense subset of the set of states for each system. On Hilbert spaces with 
finite dimension the remaining infinitesimal gaps can be filled in in such a way, 
that S becomes a continuous function. In case of Jif being of infinite dimension 
the entropy S can at best only be extended to a lower semicontinuous function, 
including +00 in its range. 

F) Semicontinuity: S(p) is lower semicontinuous. 

S(p) := inf < lim S(a(n)) | a(n) — > p in trace norm, as n — > 00 \ 

<r(n) In— >oo ) 



2 THEOREM, von Neumann- Entropy. The only function fulfilling all the 
axioms is von Neumann's entropy, as written in (T7]j. 

Proof, a) S v n(p) is an upper bound for the entropy S(p) of each p considerd 
in E: The range of Q is, by the conditions on the partial traces, a subset of 
(range(p)) lX,n which equals range(p' x,n ). For simplicity we restrict the Hilbert 
space Jtf to range(p). There we define the operator L\ and on ® n its "second 
quantized form" L, 

L x :=\np, L:=^L ljfc , (4) 

k 

where Li^ acts in the k th factor of M ) ® n . The repeated appearance of p in every 
subsystem, VA; 6 {1 . . . n}: u{k) = p, implies 

Tt(Q -L)=n- Tr(p • Li) = Tr(p® n • L). (5) 

By Klein's inequality and the maximum entropy principle, |W78| IW90] . this im- 
plies S V N(p® n ) > S v n(SI)- We present a proof, adapted to the present situation: 
Klein's inequality leads to positivity of relative entropy (section I,B5 in |W78] . 
or 3.1 in (BS2], or |NC00j ): 

Tr(fi(lnft-lnp® n )) > 0. 

Note that (jlj) implies L = \np® n , and therefore, using also ([5]): 

S vN (Q) = -Tr(filnfi) < 

< -Tr{tt ■ lnp 0n ) = -Tr(fiL) = -Tr(p®"L) = -Tr(p® n In p 0n ) = S vN {p® n ) 
Now 5(0) = S vN (Q) and S vN (p® n ) = n ■ S vN (p). 
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b) For each p considered in E the supremum is reached in the limit n — > oo, 
giving S v n{p): Consider the limit of those large numbers n which are multiples 
of a common denominator of all the eigenvalues rj of p. Define Q as the state 
proportional to the projector onto the linear span of those product states formed 
with eigenstates <pj of Li, where each such <pj appears rrij times, where rrij = n-Tj. 

Each eigenvector <pj of p can be chosen as an element of a superselection 
sector, an eigenvector of all those operators Q a whose spectral projectors define 
the sectors. With Q a \4>j) = Qajlfy) eacn °f the chosen product states has the 
same eigenvalues 



These product states are all in the same sector of M'® n , so the constructed density 
matrix Q is one block, it is a QLB state for which the axioms A . . . D are valid. 
Combinatorics gives dim (range = N, with 



By Stirling's formula, the limit n — > oo gives lim^oo ^ In N = — rrij ln(mj). 

c) Filling the infinitesimal gaps: The p considered in axiom E form a dense subset 
in the set of all density matrices. On this subset the entropy takes on finite 
values. But if dim(^) = oo, the values are unbounded, and not continuous: For 
p = J2j r j ' \4>j){4>j\ consider the sequence p(N) = p — jj\<f>i) (<fii\ + -^^(N 2 ), with 
N > 1/ri. The sequence of von Neumann entropies of p(N) diverges. So von 
Neumann entropy, which is known to be lower semicontinuous, can at best be 
extended as a semicontinuous function, including +00 in its range. □ 

3 Discussion 

3.1 On the setup and on history 

How did the founding fathers proceed, and is there a relation between their 
thoughts and the present point of view? Of the fathers of Statistical Physics, 
Clausius, Maxwell, Boltzmann (at the time of the precursor Daniel Bernoulli a 
concept of entropy was not yet in existence), Boltzmann was the first one who 
tried to give an expression for thermodynamic entropy by atomistic statistical 
terms. It is not easy to follow his thoughts in depth, see [K731 IGRY08] . But 
he, and then Gibbs, Planck, Einstein and von Neumann, jBTTj lB72j IB77] lG02j 
IP011 IP06|. IE144 lvN29] . each one of them gave a justification of his theory by 
demonstrating its applicability in thermodynamics, mainly concerning an ideal 
gas. Even in von Neumann's book on the mathematical foundations of quantum 



Q\ty = ( Q a ,k) \<Pm) = E m i * 



k k j 




(6) 
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mechanics, |vN32] . entropy as a function of density matrices is presented only in 
the subchapter V2 "Thermo dynamische Betrachtungen" . 

Now the usefulness of their atomistic formulas giving entropy in Statistical 
Physics is without any doubt. But, in the hindsight, one may question the 
uniqueness: Is it possible to find another formula, working as well as the existing 
one? That's one of the questions where the analysis presented in this paper 
may help to find answers. Other questions may be asked in the emerging theory 
of Quantum Information. There is no inductive derivation of a quantity like 
entropy of quantum information from experiments and practice. But one can 
state "desiderata" how to characterize an appropriate measure for information 
carried by quantum states. I guess, they should be identical to those posed in 
Statistical Physics. 

We deal with general states in non-relativistic Quantum Mechanics. Entropy 
has in some way to indicate the "mixedness" of a mixed state. As is well known, 
quantum mechanical mixing is in general different from the classical mixing of 
probability distributions, in that the decomposition into pure states is not unique. 
But there are still cases of classical mixing appearing in quantum mechanics. Con- 
sider for example a single point in a lattice gas of fermions. There are only two 
pure states; the lattice point is either empty or occupied. No coherent superposi- 
tion is allowed. Quantumness appears only in a larger set of lattice points, where 
there exist pure states of a particle with a wave function which extends over 
several points. But, however large the system may be, there exists no coherent 
superposition of states with different numbers of particles. Such a superselection 
rule introduces a classicality into Quantum Physics. So, a priori, we have here 
to distinguish a fully quantum mechanical system with all mixed states out of 
the same superselection sector, from states where different sectors are involved in 
the mixing. It will come only through considerations of large numbers of parts, 
forming one large system, that a universal formula for entropy emerges, making 
no difference between the kind of mixing. The a priori difference has to be consid- 
ered when postulating the invariance of entropy under unitary transformations, 
done in the following subsection 13.21 

In the vast universe of probability distributions and of mixed quantum states 
there appears a special kind of mixing: Laplace, in establishing basic methods 
for dealing with probability, considered sets of events with equal probabilities, 
because of a principle of insufficient reason, later named "the principle of indif- 
ference" . Boltzmann, considering distribution functions on phase space, perform- 
ing a careful analysis of compatibility with Hamilton- Jacobi theory of classical 
mechanics, used equal a priori probability distributions on sets of given energy. 
Now, a QLB state 7r(N) is the quantized version of such a distribution. 

In analogy to the classical theories, one may interpret a QLB state as "each 
pure state characterized by a ip G range(7r(iV)) has the same probability". We 
don't want to build on special interpretations, we use the QLB states just as 
special states, discerned from others by their symmetry. This symmetry makes 
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handling them extremely simple. But that is again a case where we have to 
consider superselection rules. Mixed states with contributions from different sec- 
tors do not allow for as many decompositions into pure states as do mixed states 
which live in only one sector. It has less symmetry. So we start with these special 
QLB states with the highest possible symmetry. 

Regarding in a short detour extreme philosophical considerations, whether 
the world should be modeled with features of infinity, or not, there appears the 
case that we should consider models of a world where only sectors with finite 
dimension exist. Now even if the elementary systems have only one- dimensional 
sectors, as in the example of a lattice gas with fermions, quantum mechanics 
comes into play. This happens through extensions in the compound systems. 
One just has to exclude that the world is completely trivial, being only one- 
dimensional. We could actually perform our considerations with a model of the 
world where there exist only sectors of finite dimension, but without an upper 
bound. But, for simplicity, we assume the existence of a sector with infinite 
dimension. This sector can serve as the reference Hilbert space. 

In classical mechanics entropy can take on all values, between minus and plus 
infinity and including these values. The desiderata, as expressed in the axioms, 
do actually lead to strict positivity of the quantum mechanical entropy function, 
with the exception of pure states, see subsection 13.31 Again for simplicity we 
start already with demanding non-negativity of S(p). 

3.2 Equivalences, the way from physics to mathematics 

Decompositions of systems are, at least since Galileo Galilei, at the heart of 
modern physics. At this point we pose the minimal postulate, that parts of a 
compound system, which are in pure states, have no effect on the entropy, and 
may as well be absent. 

Thinking of applications in thermodynamics, such a part can represent, in 
some simplification, a weight which can be heaved up or lowered in a gravitational 
field. 

Comparability of states is an essential feature, it is the central point in the 
analysis of the second law and entropy in |LY99] . While Lieb and Yngvason pos- 
tulate first the existence of an order relation between states and regard "adiabatic 
equivalence" as derived, (p ~ a if both p -< a and a -< p), the procedure in the 
present paper starts with equivalences, axioms A and B. In this paper there is 
no explicit definition of a mathematical equivalence relation; talking now about 
equivalence of states, we refer to the applicability of these two axioms (which 
could be used for a precise definition). 

Comparison, inequalities for the entropy function (axiom C), is then formu- 
lated for special equivalence classes. This relation of equivalence is the point, 
where unitary evolutions and transformations of states come into play. Already 
prior to Heisenberg's and Schrodinger's creation of Quantum Mechanics it has 
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been stated by Einstein and Szilard, that the time evolution of mixed states 
should be considered with the assumption that pure states evolve into pure states. 
Einstein, in |E14] , calls this "Ehrenfests Adiabatenhypothese" ; von Neumann in 
[vN29] cites this and also page 777 in |S24] . where Szilard refers to his doctoral 
thesis, as precursors to his formulation of entropy 

At the time of creating quantum physics only time evolution as is given by na- 
ture had to be considered. Now we have the possibility of steering time evolution 
on an atomistic scale, [C TGOlT] , without observing contradictions to the Sec- 
ond Law of Thermodynamics. So we postulate that each unitary transformation 
which is consistent with superselection rules does not change entropy. 

If we think of an eager freshman in physics with little a priori knowledge, we 
guess that this freshman would intend to think that states describing spins of 
electrons should carry completely different amounts of entropy than mathemati- 
cally equivalent states describing spins of nuclei. But such a difference does not 
appear. Entropy is a mathematical property, independent of the kind of matter 
which is described by a density matrix. We regard this fact as remarkable and 
worth while to be derived from axioms. 

Axiom B paves the way for comparison by establishing equivalences of states 
in one sector, as, f.e. states of three atoms of gold in one box. Combining 
now the equivalences of unitarily transformed states (axiom B) with forming and 
reducing compound systems by adding and discarding pure states of another 
system (axiom A) lays threads of equivalent states through all sectors. A system 
with three atoms of gold has states which are equivalent to states of a system 
with five atoms of lead, and equivalent to some states of a system with two atoms 
of gold in a box. 

3.3 QLB states form the backbone 

QLB states are completely characterized by one discrete parameter, so they show 
a truly "natural" ordering, allowing for a "natural" inequality between their en- 
tropies. This inequality can also be supported by considering certain irreversible 
evolutions in physics, namely "mixing". If M < N, the QLB state tt(N) can 
be constructed by mixing several unitarily transformed ir(M). The irreversibility 
has to be represented in an increase of entropy. These ideas are the basis for 
Uhlmann's order relation for density matrices, and could be used for a slightly 
different set of axioms, see the Appendix, section HJ 

Axiom D, on strict rules for the entropy function in case of repeated uncorre- 
lated appearance of parts, is the first step to the law of large numbers. It is here, 
that one could exclude negativity: S(ti(N 2 )) = 2 • S(ir(N)) and monotonicity 
imply S(ir(N) > 0. Only the case S(7r(0)) = — oo) would need to be excluded a 
priori or with an extra axiom. 

Note that at this point the Boltzmann-Plank formula is derived for tt(N) 
states only, with the assumption that their range is in one sector. The proba- 
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bilistic distribution over different sectors has to be analyzed by using the law of 
large numbers. 

Choosing the extensivity may be considered as convenient, but not absolutely 
necessary. For example, demanding S(p® n ) = (S(p)) n instead, would lead to a 
different, but closely related formula. 



3.4 Flesh on the backbone by the law of large numbers 

The procedure of deriving the p log p- formula from equal a priori distributions is 
not new. There are only differences in the settings. Boltzmann considers atoms 
in a gas, Gibbs shows the equivalence of canonical and microcanonical ensembles. 
Each of the founding fathers used expression (jSJ) and Stirling's formula; and also 
did Schrodinger in |S46] . 

Here we consider states from the point of view of probability theory. Di- 
rectly justifying the interpretations of entropy as a measure of uncertainty or of 
information is difficult. (This has been noted already in [BR81J, chapter 6.2.3.) 
Nevertheless, entropy arises naturally in the formalism of the law of large num- 
bers. This law is rarely addressed explicitly in the context of defining entropy, but 
it appears implicitly, when "average information" is mentioned (f.e. in [ NCOO] ). 
It is basic to application of probability theory. (Jacob Bernoulli, who established 
the Law of Large Numbers, see f.e. |B01j . regarded the derivation of this theorem 
as a greater achievement than if he had shown how to square a circle, since a 
proof the latter would have been of little use.) 

This law says, roughly, that in a typical series of N measurements the number 
rii(N) of appearances of event i lies most probably in an interval 

[N-Cy/N,N + Cy/N]- Pi . (7) 

with pi the probability for appearance of event i, and c some constant. Our axiom 
E seems, at a first glance, to deviate in two ways from this law. It accounts for 
a number of possible series which seems to be in one sense too small, in another 
sense too large. The number seems to be too small, since we restrict the rii(N) 
to one definite value, instead of letting it vary in the interval ([7]). But, taking 
the logarithm, this difference becomes negligible. Making a step from N to 2N, 
all those series covered by the interval (jJJ) appear in a series of doubled length, 
where rii(2N) = 2Npi exactly. In the Boltzmann-Plank formula the logarithm is 
taken, and In 2 is negligible for large N. 

On the other hand, the dimension of QLB-states appears as being too large, 
as quantum correlations between the repeated appearances are allowed. As an 
example consider a spin up - spin down probability distribution 2/3,1/3 and 
a series of N = 3 tests. In the 8-dimensional Hilbert space, a 3-dimensional 
subspace is accounted for by combinatorics, as in formula (jSJ). It is spanned by 
the vector and its permuted analogues. The QLB state, according to the 
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conditions of axiom E, allows for one more state vector, namely (2/3) l l 2 ttt 
+e ia (l/3) 1 / 2 iU, where a is an arbitrary phase. Since already the restriction 
of the rii(N) to strict values introduces a kind of classical correlation between 
different appearances, I see no reason to exclude a quantum correlation. But 
these extra states can not lead to an excess over the von Neumann entropy, as is 
shown in part a in the proof of Theorem [2j 

4 Appendix: Uhlmann's order relation 

States allow for a partial ordering related to mixing. It is Uhlmann's order 
relation, \U71\ \U72\ IU73t IW74] . Note, that this order relation is not the ordering 
defined in [LY98j . In matrix analysis [B97J Uhlmann's order relation is known as 
majorization: 

p >- a means: p is more mixed than a\ p is majorized by a. The definition is 

N N 

pya iff ViV £}r ft <^s n , (8) 

n=l n=l 

where the r n and the s n are the eigenvalues of p and of a in decreasing order, 
r i > r 2 > r 3 ■ ■ •; and at least one of the inequalities is strict. (Note that 7r(M) 
is more mixed than n(N) if M > N.) With this ordering one could use different 
axioms: 

C) Monotonicity: p y o =>■ S(p) > S(a), strict for a with finite rank. 

In matrix analysis the "monotonicity" is known as "Schur- concavity" . Also this 
way we have excluded any trivial S, as above we conclude ^(^(l)) = 0. All other 
states are more mixed and must have larger entropy. For states not of QLB form, 
there are still several possibilities, the first four axioms do f.e. not yet exclude 
Renyi-entropies. So it is again the law of large numbers which leads strictly to 
von Neumann's formula. The axiom F of semicontinuity can then be replaced by 

F') : S(p):=sn Pa {S(cj)\pya}. 
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